Resequencing and Clustering to Improve the Performance of Spatial Joins
نویسندگان
چکیده
The lter-and-reene strategy is well-established as the basis for spatial join algorithms. In contrast to the lter step, the reenement step has received little attention, despite contributing signiicantly to the total cost of a join evaluation. Sorting candidate tuples as produced by the lter step has recently been shown to reduce the I/O cost of reenement. Our paper reports investigations of spatial join algorithms, with particular emphasis on interactions between choices of algorithms for the lter, sequencing and reenement steps and on the eeects of clustered and unclustered organization of full spatial descriptions of objects. We extend existing techniques by caching spatial descriptions and a new sorting strategy, called zigzag. Our experiments connrm that the choice of the sequencing strategy used is very important and that clustering has a signiicant innuence on join performance. Clustering by z-ordered keys is shown to be superior to ordering by minimum bounding rectangles.
منابع مشابه
Assessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملThe Impact of Global Clustering on Spatial Database Systems
Global clustering has rarely been investigated in the area of spatial dambase systems although dramatic performance improvements can be achieved by using suitable techniques. In this paper, we propose a simple approach to global clustering called cluster organization. We will demonstrate that this cluster organization leads to considerable performance improvements without any algorithmic ovedxa...
متن کاملNew Approach for Customer Clustering by Integrating the LRFM Model and Fuzzy Inference System
This study aimed at providing a systematic method to analyze the characteristics of customers’ purchasing behavior in order to improve the performance of customer relationship management system. For this purpose, the improved model of LRFM (including Length, Recency, Frequency, and Monetary indices) was utilized which is now a more common model than the basic RFM model apt for analyzing the cus...
متن کاملNew spatial clustering-based models for optimal urban facility location considering geographical obstacles
The problems of facility location and the allocation of demand points to facilities are crucial research issues in spatial data analysis and urban planning. It is very important for an organization or governments to best locate its resources and facilities and efficiently manage resources to ensure that all demand points are covered and all the needs are met. Most of the recent studies, which f...
متن کاملProposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms
In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996